Shortest Connection Networks 
And Some Generalizations 

By R. C. PRIM 

(Manuscript received May 8, 1957) 

The basic 'problem considered is that of interconnecting a given set of 
terminals with a shortest possible network of direct links. Simple and prac- 
tical procedures are given for solving this problem both graphically and 
computationally. It develops thai these procedures also provide sohdions 
for a much broader class of problems, containing other examples of practical 
interest, 

I. INTRODUCTION 

A problem of inherent interest in the planning of large-scale communi- 
cation, distribution and transportation networks also arises in connec- 
tion with the current rate structure for Bell System leased-line services. 
It is the following: 

Basic Problem — Given a set of (point) terminals, connect them by a 
network of direct terminal-to-terminal links having the smallest possible 
total length (sum of the link lengths). (A set of terminals is "connected," 
of course, if and only if there is an unbroken chain of links between every 
two terminals in the set.) An example of such a Shortest Connection Net- 
work is shown in Fig. 1. The prescribed terminal set here consists of 
Washington and the forty-eight state capitals. The distances on the par- 
ticular map used are accepted as true. 

Two simple construction principles will be estabhshed below which 
provide simple, straight-forward and flexible procedures for solving the 
basic problem. Among the several alternative algorithms whose validity 
follows from the basic construction principles, one is particularly well 
adapted for automatic computation. The nature of the construction 
principles and of the demonstration of their validity leads quite naturally 
to the consideration, and solution, of a broad class of minimization prob- 
lems comprising a non-trivial abstraction and generalization of the basic 
problem. This extended class of problems contains examples of practical 
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interest in quite difforont contexts from those in which the basic pi'ob- 
1cm had its genesis. 

II. (:f)NSTHUCTION PHIXf'IPLKS FOR SHORTEST COXNECTION NETWORKS 

In order to state the rules tor solution of the basic problem concisely, 
it is necessary to introduce u few, almost self-explanatory, terms. An 
isolated terminal is a terminal to which, at a given stage of the construc- 
tion, lu) connections have yet been made. (In Fig. 2, terminals 2, 4, and 
9 are the only isolated ones.) A fragmoU is a terminal subset connected 
by direct links, between members of the subset. (In Fig. 2, 8-3, 1-6-7-5, 
r)-(')-7, and l-fi are some of the fragments; 2-4, 4-8-3, 1-5-7, and 1-7 are 
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Fig. 2 — I'jirtijil connection notwoi'k. 

not fragments.) The tlistancc of a lenmnal from a Jraqment of which it 
is not an element is the minimum of its distances from the individual 
terminals comprising the fragment. An isolated fragment is a fragment 
to which, at a given stiige of the construction, no external (connections 
have been made. (In Fig. 2, 8-3 and l-()-7-5 are the only isolated frag- 
ments.) A nearest neighbor of a terminal is a terminal whose distance 
fi'om the specified terminal is at least as small as that of any other. A 
nearest neighbor of a fragment, analogously, is a terniinal whose distance 
from the specified fragment is at least as small as that of any other. 

The two fundamental construction principles (PI and P2) for shortest 
connection networks can now l)e stated as follows: 

Principle 1 — Any isolated terminal can be connected to a nearest 
neighbor. 

Principle 2 — Any isolated fragment can be connected to a nearest 
tieighbor by a shortest available link. 

For example, the next steps in the incomplete construction of Fig. 2 
couki be any one of the following: 

(1) add link 9-2 (PI applied to Term. 9) 
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(2) add link 2-9 (PI applied to Term. 2) 

(3) add link 4-8 (PI applied to Term. 4) 

(4) add link 8-4 (P2 applied to frag. 3-8) 

(5) add link 1-9 (P2 applied to frag. l-G-7-5). 

One possible sequence for completing this construction is: 4-8 (PI), 8-2 
(P2), 9-2 (PI), and 1-9 (P2). Another is: 1-9 (P2), 9-2 (P2), 2^8 (P2), 
and 8-4 (P2). 

As a second example, the construction of the network of Fig. 1 could 
have proceeded as follows: Olympia-Salem (PI), Salem-Boise (P2), Boise- 
Salt Lake City (P2), Plelena-Boise (PI), Sacramento-Carson City (PI), 
Carson City-Boise (P2), Salt Lake City-Denver (P2), Phoenlx-Santa Fe 
(PI), Santa Fe-Denver (P2), and so on. 

The kind of iutermLxture of appli(!ations of PI and P2 demonstrated 
here is very efficient when the shortest connection network is actually 
being laid out on a map on which the given terminal set is plotted to 
scale. With only a few minutes of practice, an example as complex as 
that of Fig. 1 can be solved in less than 10 minutes. Another mode of 
procedure, making less use of the flexibility permitted by the construc- 
tion principles, involves using PI only once to produce a single frag- 
ment, which is then extended by successive applications of P2 until the 
network is completed. This highly systematic variant, as will emerge 
later, has advantages for computer mechanization of the solution proc- 
ess. As applied to the example of Fig. 1, this algorithm would proceed 
as follows if Sacramento were the indicated initial terminal: Sacramento- 
Carson City, Carson City-Boise, Boise-Salt Lake City, Boise-Helena, 
Boise-Salem, Salem-Olympia, Salt Lake City-Denver, Denver-Cheyenne, 
Denver-Santa Fe, and so on. 

Since each application of either PI or P2 reduces the total number 
of isolated terminals and fragments by one, it is evident that an iV-ter- 
minal network is connected by A^-1 applications. 

HI. VALIDATION OF CONSTRUCTION' PRINCIPLES 

The validity of Pi and P2 depends essentially on the establishment 
of two necessary conditions (NCI and KC2) for a shortest connection 
network (SCN): 

Necessary Condition 1 — Every terminal in a SCN is directly con- 
nected to at least one nearest neighbor. 

Necessary Condition 2 — Every fragment in a SCN is connected to at 
least one nearest neighbor hy a shortest available path. 

To simplify the argument, it will at first be assumed that all distances 
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between terminals are different, so that each terminal or fragment has 
a single, uniquely dcfiiied, nearest neighbor. This restriction will be 
removed later. 

Consider first NCI. Suppose there is a SCN for which it is untrue. 
Then [Fig. 3(a)] some terminal, t, in this network is not directly joined 
to its nearest neighbor, n. Since the network is connected, i is necessarily 
joined directly to some one or more terminals other than n — say /i , 
• ■ ■ ,fr • For the same reason, n is necessarily joined through some chain, 
C, of one or more links to one of /i , • ■ -jfr — say to fk ■ Now if the link 
t — fk m removed from the network and the link t—nis added [Fig. 3(b)], 
the connectedness of the network is clearly not destroyed — fk being 
joined to t through n and C, rather than directly. However, the total 
length of the network has now been decreased, because, by hypothesis, 
/ — 71 is shorter than t—jk- Hence, contrary to the initial supposition, the 
network contradicting NCl could not have been the shortest, and the 
truth of NCl follows. From NCl follows PI, which merely permits the 
addition of links which NCl shows have to appear in the final SCN. 

Turning now to NC2, the above argument carries over directly if t 
is thought of as a fragment of the supposed contradictory SCN, rather 
than as an individual terminal ~ provided, of course that the t — n link 
substituted for t — fk is the shortest link from n to any of the terminals 
belonging to t. From the validity of NC2 foUows P2 — ^ again the links 
whose addition is permitted by P2 are all necessary, by NC2, in the 
final SCN. 

The temporary restrictive assumption that no two inter-terminal 
distances are identical mu.st now be removed. A reappraisal of the 
proofs of NCl and NC2 shows that they are still valid if n is not the 
only terminal at distance I — n from t, for in the supposedly contradictory 
network the distance / — ft must be greater than t — n. What remains to be 
established is that the length of the final connection network resulting 





Fig. 3 — Schematic demoDstration of NCl. 
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from successive applications (A'^ — 1 for A^ terminals) of PI and P2 is 
independent of which nearest neighbor is chosen for connection at a 
stage when more than one nearest neighbor to an isolated terminal or 
t is available. This is a consequence of the following considera- 
tions; For a prescribed terminal set there are only a Jinite number of 
connection networks (certainly fewer than Cn'^C'^''^ — the number of 
distinct waysof choosingA'" - 1 hnks from the total QiN(N — l)/2 possible 
links). The length of each one of this finite set of connection networks is 
a continuous function of the individual inter terminal distances, rf.-y (as a 
matter of f ct, it is a linear function with coefficients and 1). With 
the rf,j specified, the length, L, of a shortest connection network is 
simply the smallest length in this finite set of connection network 
lengths. Therefore L is uniquely determined. (It may, of course, be 
associated with more than one of the connection networks.) Now, if at 
each stage of construction employing PI and P2 at which a choice is to 
be made among two or more nearest neighbors Ui , • ■ ■ , n^ of an isolated 
terminal (or fragment) t, a small positive quantity, e, is subtracted from 
any specific one of the distances d,„^ , ■ ■ ■ , d,„, — say from di,,^ — the 
construction will be uniquely determined. The total length, L', of the 
resulting SCN for the modified problem will now depend on e, as well 
as on the d^j of the original terminal set. The dependence on e will be 
continuous, however, because the minimum of a finite set of continuous 
functions of e (the set of lengths of all connection networks of the modi- 
fied problem) is itself a continuous function of e. Hence, as e is made 
vanishingly small, L' approaches L, regardless of which "nearest neigh- 
bor" links were chosen for shortening to decide the construction. 

IV. ABSTRACTION AND GENERALIZATION 

In the examples of Figs. I and 2, the terminal set to be connected was 
represented by points on a distance-true map. The decisions involved 
in applying I'l and P2 could then be based on visual judgements of 
relative distances, perhaps augmented by application of a pair of di- 
viders in a few close instances. These direct geometric comparisons can 
of course, be replaced by numerical ones if the inter-terminal distances 
are entered on the terminal plot, as in Fig. 4(a). The application of PI 
and P2 goes through as before, with the relevant "nearest neighbors" 
determined by a comparison of numerical labels, rather than by a 
geometric st^anning process. For example. Pi applied to Terminal 5 of 
Fig. 4(a) yields the Link 5-6 of the SCN of Fig. 4(b), because 4.6 is 
less than 5.6, 8.0, 8.5, and 5.1, and so on. 



SHOHTEST COXXECTIOX NETWORKS 



1395 



Wlieu the construction of shortest connection networks is thus reduced 
to processes involving only the numerical distance labels on the various 

possil)Ic links, the actual iaration of the points representing the various 
terminals iu a graphical representation of the problem is, of course, 
inconsequential. The problem of Fig. 4(a) can just as well be represented 
by Fig. o(a), for example, and PI and P2 applied to obtain the SCN 
of Fig. 5(b). The original metric problem concerning a set of points in 
the plane has now been abstracted into a problem concerning labelled 
grnphti. The correspondence l)et\veen the lerminology employed thus 
far and more conventional language of Graph Theory is as follows: 

terminal <-> vertex 

possil>le link *-^ edge 

length of link <-> "length" (or "weight") of edge 

connection network ■<-»■ spanning subgraph 

(without closed loops) <-> (spanning subtree) 





(a) 

Fig. 4 — Example of a shortest spiiiining subtree of ji (complete labelled graph. 
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Fig. 5 — Example of a .shortest .spanning subtree of n complete labelled graph. 
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shortest connection network <-^ shortest spannmg subtree 

SCN ^ SSS 
It will be useful and worthwhile to carry over the concepts of "fragment" 
and "nearest neighbor" into the graph theoretic framework. Pi and P2 
can now be regarded as construction principles for finding a shortest 
spanning subtree of a labelled graph. 

In the originating context of connection networks, the graphs from 
which a shortest spanning subtree is to be extracted are cotnpJete graphs; 
that is, graphs having an edge between every pair of vertices. It is 
natural, now, to generalize the original problem by seeking shortest 
spanning subtrees for arbitrary connected labelled graphs. Consider, for 
example, the labelled graph of Fig. 6(a) which is derived from that of 
Fig. 5(a) by deleting some of the edges. (In the connection network 
context, this is equivalent to barring direct connections between certain 
terminal pairs.) It is easily verified that NCI and NC2, and hence PI 
and P2, are valid also in these more general cases. For the example of 
Fig. 6(a), they yield readdy the SSS of Fig. 6(b). 

As a further generalization, it is not at all necessary for the validity 
of PI and P2 that the edge "lengths" in the given labelled graph be 
derived, as were those of Figs. 4-6, from the inter-point distances of 
some point set in the plane. PI and P2 will provide a SSS for any con- 
nected labelled graph with any set of real edge "lengths." The "lengths" 
need not even be positive, or of the same sign. See, for example, the 
labelled graph of Fig. 7(a) and its SSS, Fig. 7(b). It might be noted in 
passing that this degree of generality is sufficient to include, among 
other things, shortest connection networks in an arbitrary number of 
dmrensions. 

A further extension of the range of problems solved by PI and P2 
follows trivially from the observation that the maximum of a set of 
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Fig. 6 — Example of a shortest spanning subtree of an incomplete labelled graph . 
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real numbers is the same as the negati\'e of the minimum of the negatives 
of the set. Therefore, PI and P2 can be used to construct a longest 
spannmg subtree by changing the signs of the "lengths" on the given 
luljelled graph. Fig. 8 gi\'es, as an example, the longest spanning subtree 
for the labelled graph of Figs. 4(a) and 5(a). 

It is ea.sy to extend the arguments in support of XCl and XC2 from 
the simple case of minimizing the sum to the more general problems of 
minimizing an arbitrary increasing symmetric function, or of maximizing 
an arbitrary decreasing symmetric function, of the edge "lengths" of a 
spanning subtree. (A sjTiimetric function of n variables is one whose 
value is unchanged by any interchanges of the variable values; e.g., 
Xi -^ X2 -\- ■ • ■ -\- x„ , Xi Xo ■ ■ ■ x„ , sin 2a-i + sin 2xn + ■ ■ ■ -\- sin 2.'C„ , 
(.^i' + x^ + ■ ■ ■ + Xn^Y"-, etc.) From this follow the non-trivial generali- 



zations. 



The shortest spanning subtree of a connected labelled graph 
also minimizes all increasing symmetric functions, and maxi- 
mizes all decreasing symmetric functions, of the edge "lengths." 




(a) 




Fig. 7 — ICxiLiuple of ;i .sli(irte.st spanning subtree of a Ijibelled graph with 
some "lengths" negative. 



®0 " 0© 




Fig. 8 — The longest spanning snbtree of the labeled graph of Figs. 4(a) and 
5(ii). 
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The longest spanning subtree of a connected labelled graph 
also maximizes all increasing symmetric functions, and mini- 
mizes all decreasing symmetric functions, of the edge "lengths." 

For example, Avith positive ''lengths" the same spanning sulitree that 
minimizes the sum of the edge "lengths" also minimizes the product and 
the square root of the sum of ike squares. At the same time, it maximizes 
the sum of the reciprocals and the product of the arc cotangents. 

It seems likely that these extensions of the original class of problems 
soluble by PI and P2 contain many examples of practical interest in 
quite different contexts from the original connection networks. A not 
entirely facetious example is the following: A message is to be passed 
to all members of a certain underground organization. Each member 
knows some of the other members and has procedures for arranging a 
rendezvous with anyone he knows. Associated with each such possible 
rendezvous — say between member "i" and member "/" — is a certain 
probability, p/j , that the message will fall into hostile hands. How is 
the message to be distributed so as to minimize the over-all chances of 
its being compromised? If members are represented as vertices, possible 
rendezvous as edges, and compromise probabilities as "length" labels 
in a labelled graph, the problem is to find a spanning subtree for which 
\ — n(l — p.j) is minimized. Since this is an increasing symmetric 
function of the pi/s, this is the same as the spanning subtree minimiz- 
ing 2 pij , and this is ca.sily found by PI and P2. 

Another application, closer to the original one, is that of minimizing 
the lengths of wire used in cabhng panels of electrical equipment. Re- 
strictions on the permitted wiring patterns lead to shortest connection 
network problems in w-hich the effective distances between terminals 
are not the direct terminal-to-terminal distances. Thus the more general 
viewpoint of the present section is applicable. 

V. COMPUTATIOXAL TECHNIQUE 

Return now to the exemplary shortest connection network problem 
of Figs. 4(a) and 5(a) which served as the center for discussion of the 
arithmetizing of the metric factors in the Basic Problem. It is evident 
that the actual drawing and labelling of all the edges of a complete 
graph will get very cumbersome as the number of vertices increases — 
an A''-vertex graph has {l/2)(N'^ — N) edges. For large N, it is convenient 
to organize the numerical metric information in the form of a distance 
table, such as Fig. 9, which is equivalent in content to Fig. 4(a) or Fig. 
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5(a). (A distance table can also be prepared to represent an incomplete 
labelled graph liy entering the length of non-existent edges as cc .) 

When it is dcsirt^d to determine a shortest connection network directly 
from the distaiK^e tal)le representation — either manually, or l)y machine 
computation — one of the numerous particular algorithms obtainable 
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Fig. 9 — Distance table equivalent of Figs. 4(a) and 5(a). 
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Fig, 10 — Illustrative computational determination of a shortest connection 
network. 
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by restricting the freedom of choice allowed by PI and P2 is distinctly 
superior to other alternatives. This variant is the one in which PI is 
used but once to produce a single isolated fragment, which is then ex- 
tended I)y repeated applications of P2. 

The successive steps of an efficient computational procedure, as ap- 
plied to the example of Fig. 9, are shown in Fig. 10. The entries in the 
top rows of the successive F tables are the distances from the connected 
fragment to the unconnected termmals at each stage of fragment growth. 
The entries in parentheses in the second rows of these tables indicate 
the nearest neighbor m the fragment of the external terminal in question. 
The computation is started by entering the first row of the distance 
table into the F table (to start the growing fragment from Terminal 1). 
A smallest entry in the F table is then selected (in this case, 2.8, asso- 
ciated with External Terminal 4 and Internal Terminal 1). The hnk 1-4 
is deleted from the F table and entered in the Solution Summary (Fig. 
11). The remaining entries in the first stage F table are then compared 
with the corresponding entries in the "4" row of the distance table 
(reproduced beside the first F table). If any entry of this "added ter- 
minal" distance table is smaller than the corresponding 'F table entry, 
it is substituted for it, with a corresponding change in the parenthesized 
index. (Since 3.4 is loss than 5.2, the 3 column of the /'"' table becomes 
3.4/(4).) This process is repeated to yield the list of successive nearest 
neighbors to the growing fragment, as entered in Fig. 11. The F and 
"added terminal" distance tables grow shorter as the number of un- 
connected terminals is decreased. 

This computational procedure is easily programmed for an automatic 
computer so as to handle quite large-scale problems. One of its advan- 
tages is its avoidance of checks for closed cycles and connectedness. 
Another is that it never requires access to more than two rows of distance 
data at a time — no matter how large the problem. 
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Fig. 11 — Solution summary for computation of Fig. 10. 
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VI. RELATED LITEKATURE AND PROBLEMS 

J. B. Kiuskal, Jr.' di.scus.se.s the problem of constructing shortest 
spainiing .sul>trees for labelled grajihs. He considers only distinct and 
positive sets of edge lengths, and is primarily interested in establishing 
uniquene.ss under these conditions. (This follows immediately from XCl 
and NC2.) lie also, however, gi\'es three diffei"cut constructions, or 
algorithms, for finding SSS's. Two of these are contained as special 
cases in PI — P2. The third is — "Perform the following .step as many 
times as po.ssible: Among the edges not yet chosen, choose the longest 
edge whose removal will not disconnect them" While this is not directly 
a special case of PI — P"2, it is an obvious corollary of the special case 
in which the shortest of the edges permitted by PI — P2 is selected at 
each stage. Kruskal refers to an ob.scure Czech papei^ as giving a con- 
struction and uniqueness proof inferior to his. 

The simplicity and powoi' of the solution afforded by PI and P2 for 
the Ba.sic Problem of the present paper comes as something of a surprise, 
because there arc well-known prolilems which scetn quite similar in 
nature for which no efficient solution procedure is known. 

One of these is Stciner's Problem : Find a shortest connection network 
for a given terminal set, with freedom to add additional terminals 
whei'ever desired. A number of necessary properties of thcac networks 
are known^ hut do not lead to an effective .solution procedure. 

Another is the Traveling Salesman Problem: Find a closed path of 
minimum length connecting a prescribed terminal set. Nothmg even 
approaching an effective solution procedure for this problem is now 
known (early 1957). 
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